Trees, Dendrograms and Sensitivity
نویسنده
چکیده
Dendrograms and minimum-weight spanning trees (MWST) are discrete structures which arise in clustering theory, networks, and where strategic choices are made between discrete options. The discrete structure of the solution tree varies discontinuously with respect to changes in the dissimilarity or cost matrix. Tarjan [1982] has obtained bounds on the changes to the elements of the cost matrix, where the structure is not altered. These results are extended to describe the sensitivity of the tree structure with respect to changes in the cost matrix. The sensitivity of the structure to the addition of a node and arcs is solved using potential loops in the tree. This phase of the solution process builds onto the solution to the original MWST. The full sensitivity analysis of a dendrogram to changes in the attribute matrix is very complex, due partly to the corresponding cost matrix being a function of the full attribute matrix. Statistical methods are used to compare changes in the tree structure with changes in the attribute matrix. Analytical results are obviously difficult to obtain. The Kruskal clustering algorithm is used on the similarity or cost matrix to construct both the MWST and the dendrogram. Thus the sensitivity of the dendrogram to the similarity or cost matrix corresponds to the sensitivity of the MWST for this method of clustering.
منابع مشابه
On the similarity of dendrograms.
A metric on binary trees is dehed to give the similarity of two dendrograms. One of the major desirable properties of the proposed treesimilaritymeasure is to clarify the decision ordering nature of biological trees. This metric is applied to evolutionary tree reconstructions and comparative embryo-genesis. The mathematical properties of this metric are discussed, and an algofithm is proposed t...
متن کاملInterpreting Evolutionary Trees
Content: Evolutionary Trees Evolutionary tree diagrams are branching diagrams (dendrograms) that depict the phylogenetic relationships between taxa based on shared derived characters (synapomorphies) that reflect common ancestry. Phylogenies depict an historical pattern of divergence and descent as series of branches; these branches merge at points representing common ancestry, which in turn ar...
متن کاملListening to viral tongues: comparing viral trees using a stochastic context-free grammar.
We suggest a probabilistic method for comparing the topological features of large phylogenetic trees. Using this method, we demonstrate that a stochastic grammar can generate three influenza-subtype (A H1, A H3, and B) hemagglutinin trees used in an earlier study, with statistically similar parameters. The proposed methodology is applicable to a broad class of problems that require comparison o...
متن کاملMinimising branch crossings in phylogenetic trees
Abstract: While phylogenetic trees are widely used in bioinformatics, one of the major problems is that different dendrograms may be constructed depending on several factors. Albeit numerous quantitative measures to compare two different phylogenetic trees have been proposed, visual comparison is often necessary. Displaying a pair of alternative phylogenetic trees together by finding a proper o...
متن کاملFlow of Information during an Evolutionary Process: The Case of Influenza A Viruses
The hypothesis that Mutual Information (MI) dendrograms of influenza A viruses reflect informational groups generated during viral evolutionary processes is put forward. Phylogenetic reconstructions are used for guidance and validation of MI dendrograms. It is found that MI profiles display an oscillatory behavior for each of the eight RNA segments of influenza A. It is shown that dendrograms o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002